Research of Feature Weighting Method Based on Document Structure
نویسندگان
چکیده
منابع مشابه
Feature Weighting Method Based On Instance Correlation Using Discretization
In Machine Learning Process, several issues arise in identifying a suitable and quality set of features from which a classification model for a particular domain to be constructed. This paper addresses the problem of feature selection for machine learning through discretization approach. RELIEF is considered to be one of the most successful algorithms for assessing the quality of features. RELI...
متن کاملOverlap-based feature weighting: The feature extraction of Hyperspectral remote sensing imagery
Hyperspectral sensors provide a large number of spectral bands. This massive and complex data structure of hyperspectral images presents a challenge to traditional data processing techniques. Therefore, reducing the dimensionality of hyperspectral images without losing important information is a very important issue for the remote sensing community. We propose to use overlap-based feature weigh...
متن کاملDocument Clustering using Weighting and Labels based on Inherent Structure of Document
In classic document clustering, documents appear terms frequency without considering the semantic information of each document (i.e., vector model). The property of vector model may be incorrectly classified documents into different clusters when documents of same cluster lack the shared terms. Recently, to overcome this problem uses knowledge based approaches. However, these approaches have an...
متن کاملTerm weighting based on document revision history
In real-world information retrieval systems, the underlying document collection is rarely stable or definite. This work is focused on the study of signals extracted from the content of documents at different points in time for the purpose of weighting individual terms in a document. The basic idea behind our proposals is that terms that have existed for a longer time in a document should have a...
متن کاملFeature Selection Method Based on Improved Document Frequency
Feature selection is an important part of the process of text classification, there is a direct impact on the quality of feature selection because of the evaluation function. Document frequency (DF) is one of several commonly methods used feature selection, its shortcomings is the lack of theoretical basis on function construction, itwill tend to select high-frequency words in selecting. To sol...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: DEStech Transactions on Computer Science and Engineering
سال: 2017
ISSN: 2475-8841
DOI: 10.12783/dtcse/itms2016/9493